首页> 外文OA文献 >A Potential Reduction Algorithm for Two-person Zero-sum Mean Payoff Stochastic Games

【2h】

A Potential Reduction Algorithm for Two-person Zero-sum Mean Payoff Stochastic Games

机译：一种二人零和均值收益的潜在约简算法随机游戏

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We suggest a new algorithm for two-person zero-sum undiscounted stochasticgames focusing on stationary strategies. Given a positive real $\epsilon$, letus call a stochastic game $\epsilon$-ergodic, if its values from any twoinitial positions differ by at most $\epsilon$. The proposed new algorithmoutputs for every $\epsilon>0$ in finite time either a pair of stationarystrategies for the two players guaranteeing that the values from any initialpositions are within an $\epsilon$-range, or identifies two initial positions$u$ and $v$ and corresponding stationary strategies for the players provingthat the game values starting from $u$ and $v$ are at least $\epsilon/24$apart. In particular, the above result shows that if a stochastic game is$\epsilon$-ergodic, then there are stationary strategies for the playersproving $24\epsilon$-ergodicity. This result strengthens and provides aconstructive version of an existential result by Vrieze (1980) claiming that ifa stochastic game is $0$-ergodic, then there are $\epsilon$-optimal stationarystrategies for every $\epsilon > 0$. The suggested algorithm is based on apotential transformation technique that changes the range of local values atall positions without changing the normal form of the game.

机译：我们为集中于固定策略的两人零和无折扣随机游戏提出了一种新算法。给定真实的真实\ epsilon $，如果它的任意两个初始位置的值相差最多$ epsilon，则letus称为随机游戏$ \ epsilon $遍历。提议的新算法在有限时间内每$ \ epsilon> 0 $输出，或者两个参与者的一对固定策略保证任何初始位置的值都在$ \ epsilon $范围内，或者标识两个初始位置$ u $和$ v $和玩家相应的固定策略证明，从$ u $和$ v $开始的游戏价值至少为$ \ epsilon / 24 $。特别地，以上结果表明，如果随机游戏是\\ epsilon $遍历的，那么就有固定的策略让玩家证明$ 24 \ epsilon $遍历。这个结果加强了Vrieze（1980）的存在性结果，并提供了一个建设性的版本。Vrieze认为，如果一个随机博弈是$ 0 $遍历的，那么对于$> epsilon≥$$，就有$ \ epsilon $-最优静态策略。所建议的算法基于电位变换技术，该技术可在所有位置更改局部值的范围而无需更改游戏的正常形式。

著录项

作者
Boros, Endre; Elbassioni, Khaled; Gurvich, Vladimir; Makino, Kazuhisa;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. A Potential Reduction Algorithm for Two-Person Zero-Sum Mean Payoff Stochastic Games [J] . Boros Endre, Elbassioni Khaled, Gurvich Vladimir, Dynamic games and applications . 2018,第1期

机译：两个人零汇率平均支付随机游戏的潜在减少算法
2. Two-Person Zero-Sum Stochastic Games with Semicontinuous Payoff [J] . R. Laraki, A.P. Maitra, W.D. Sudderth Dynamic games and applications . 2013,第2期

机译：具有半连续收益的两人零和随机游戏
3. Two-Person Zero-Sum Stochastic Games with Semicontinuous Payoff [J] . R. Laraki, A.P. Maitra, W.D. Sudderth Dynamic games and applications . 2013,第2期

机译：具有半连续收益的两人零和随机游戏
4. Stochastic Recursive Zero-Sum Differential Game and Mixed Zero-Sum Differential Game Problem with Payoff Functional in BDSDES [C] . Renwei Jia, Lifeng Wei, Xiaodong Liu IEEE International Conference of Safe Production and Informatization . 2020

机译：随机递归零和差动游戏和BDSDES的收益功能混合零和差分游戏问题
5. On payoff allocations for assignment games and on algorithms for stochastic games. [D] . Brugueras, Jaime. 2006

机译：关于分配游戏的收益分配和关于随机游戏的算法。
6. Zero-Sum Matrix Game with Payoffs of Dempster-Shafer Belief Structures and Its Applications on Sensors [O] . Xinyang Deng, Wen Jiang, Jiandong Zhang 2017

机译：具有Dempster-Shafer信念结构收益的零和矩阵博弈及其在传感器中的应用
7. Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria [O] . José Daniel López-Barrientos 2014

机译：零汇率随机差动游戏的政策迭代算法，长期平均收益标准
8. On a Class of Two-Person, Zero-Sum Games with Vector-Valued Payoff Functions [R] . Chattopadhyay, R. 1970

机译：一类具有向量值支付函数的两人零和游戏

A Potential Reduction Algorithm for Two-person Zero-sum Mean Payoff Stochastic Games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅